Reinforcement Learning Inspired Disturbance Rejection and Nao Bipedal Locomotion
نویسنده
چکیده
Competitive bipedal soccer playing robots need to move fast and react quickly to changes in direction while staying upright. This paper describes the application of reinforcement learning to stabilise a flat-footed humanoid robot. An optimal control policy is learned using a physics simulator. The learned policy is supported theoretically and interpreted on a real robot as a linearised continuous control function. The paper also describes other components, including foot-step coordination, of bipedal locomotion integrated to achieve reactive omni-directional locomotion for Nao robots used in the RoboCup Standard Platform League.
منابع مشابه
Bipedal Locomotion Primitive Learning, Control and Prediction from Human Data
At the current stage bipedal robot locomotion is quite different from human walking. Imitation learning framework from human demonstrations is an efficient approach to lead towards human-like behaviors. This paper addresses a framework for real-time wholebody human motion imitation by a humanoid robot. The framework is a structured mixture of whole body motion control, learning and prediction. ...
متن کاملLateral Disturbance Rejection for the Nao Robot
Maintaining balance in the presence of disturbances is crucial for bipedal robots. In this paper, we focus on the lateral motion component. In order to attain disturbance rejection and to quickly recover balance, we combine three different control approaches. As a principal building block, we generate center of mass trajectories with a linear model predictive controller that takes scheduled foo...
متن کاملFeedback Control For Cassie With Deep Reinforcement Learning
Bipedal locomotion skills are challenging to develop. Control strategies often use local linearization of the dynamics in conjunction with reduced-order abstractions to yield tractable solutions. In these model-based control strategies, the controller is often not fully aware of many details, including torque limits, joint limits, and other non-linearities that are necessarily excluded from the...
متن کاملGait Generation for a Bipedal System By Morris-Lecar Central Pattern Generator
The ability to move in complex environments is one of the most important features of humans and animals. In this work, we exploit a bio-inspired method to generate different gaits in a bipedal locomotion system. We use the 4-cell CPG model developed by Pinto [21]. This model has been established on symmetric coupling between the cells which are responsible for generating oscillatory signals. Th...
متن کاملExploiting Human Motor Skills for Training Bipedal Robots Undergraduate Honors Thesis
Although machine learning, reinforcement learning, and learning from demonstration have improved the rate and accuracy at which robots can gain intelligence from humans, they haven’t reached the rapid rate at which humans are able to acquire new knowledge. Many systems that exploit imitation learning use simple positive and negative reinforcement, and place the burden of learning completely on ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2015